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BLIND EQUALIZERS USING PROBABILITY DENSITY 
MATCHING AND PARZEN WINDOWING 

CROSS REFERENCE TO RELATED APPLICATIONS 
[0001] This application claims the benefit of U.S. Provisional Application No. 
60/459,287, filed in the United States Patent and Trademark Office on March 31, 2003, 
the entirety of which is incorporated herein by reference. 

STATEMENT AS TO FEDERALLY SPONSORED RESEARCH 

[0002] This invention was made with United States Government support under grant 
number ECS-9900394 awarded by the National Science Foundation. The United States 
Government has certain rights in the invention. 

BACKGROUND 

Field of the Invention 

[0003] The present invention relates to the field of communications and, more 
particularly, to equalization for digital communication channels. 

Description of the Related Art 

[0004] Channel equalization plays a key role in digital communication systems. 
Typically, the physical channel introduces a distortion to the transmitted signal that can 
make it difficult to recover the original data. An equalizer can reduce, or ideally 
completely eliminate, the introduced intersymbol interference (ISI). Conventional 
equalization techniques rely on the transmission of a reference, or training, sequence that 
is known at the equalizer. This sequence allows adaptation of the equalizer parameters to 
minimize some cost function that measures the distance between the actual equalizer 
output and the desired reference signal. For instance, when the equalizer is implemented 
by means of a linear filter, the filter coefficients can be adapted using least mean squares 
analysis to minimize the expectation of the squared error. This type of equalization is 
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referred to as supervised equalization. 

[0005] When a training or reference sequence is not available at the receiver, blind 
equalization can be used. Without a reference sequence, the only knowledge about the 
transmitted sequence is limited to its probabilistic or statistical properties. Under this 
constraint, blind equalization typically minimizes a cost function that is able to indirectly 
extract the higher order statistics of the signal or the current level of ISI at the equalizer 
output. Typically, the cost function is minimized by means of a stochastic gradient 
algorithm. Examples of this kind of algorithm can include the Sato algorithm as 
disclosed in Y. Sato, "A method of self-recovering equalization for multilevel amplitude 
modulation," IEEE Transactions on Communications, vol. COM-23, pp. 679-682 (1975); 
and the Godard algorithms as disclosed in D. N. Godard, "Self-recovering equalization 
and carrier tracking in two dimensional data communication systems," IEEE 
Transactions on Communications, vol. 28, pp. 1867-1875, (Nov. 1980). 
[0006] One significant disadvantage of blind equalization techniques such as those 
noted herein is their need for a high number of data symbols to achieve convergence to a 
good solution. The high number of data symbols is necessary because the criterion for 
optimization is unable to exploit the high order statistics present in the data. Following 
an initial approximate equalization stage, which could be either supervised or blind, 
typically decision-directed equalization is employed. Decision-directed equalization 
utilizes discretized versions of the equalizer outputs as desired reference signal values. 
The discretization is typically done by approximating the current value of the output with 
the nearest value from the symbol alphabet called the symbol constellation. 
[0007] Other techniques attempt to improve the convergence speed of conventional 
blind equalizers and utilize higher order statistics. For example, Renyi's entropy, as 
disclosed in I. Santamaria, et al, "A fast algorithm for adaptive blind equalization using 
Renyi's entropy," in Proceedings of the International Conference on Acoustics, Speech, 
and Signal Processing, vol. Ill, Orlando, FL, USA, 2002, pp. 2657-2660, has been used 
as a cost function for blind equalization of constant modulus signals. This approach is an 
application of information theoretic criteria to equalization and uses the Parzen window 
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method (a nonparametric method) to estimate the underlying probability density function. 
A discussion of information theoretic criteria can be found in J. Principe, et aL, "Learning 
from examples with information theoretic criteria," Journal of VLSI Signal Processing, 
vol. 26, pp. 61-77, 2000. Although this method can provide acceptable results for some 
communication channels, Renyi's entropy does not produce acceptable results for ill- 
conditioned channels, or those having zeros at or very close to the unit circle. Further, 
this technique is sensitive to noise. 

[0008] What is needed is an equalization technique that is applicable to any symbol 
constellation standard and which provides improved performance over existing 
equalization techniques. 

SUMMARY OF THE INVENTION 
[0009] The inventive arrangements disclosed herein provide a method, system, and 
apparatus for a blind equalization technique for digital communication channels. The 
present invention can be applied to any symbol constellation used for digital 
communications, including, but not limited to, phase-shift-keying, quadrature amplitude 
modulation, or other types of discrete alphabets. In accordance with the inventive 
arrangements disclosed herein, the number of samples required to blindly identify the 
channel inverse for equalization can be reduced. As such, increased efficiency with 
respect to the use of transmission bandwidth in current digital communication systems 
where channel equalization is necessary can be achieved. 

[0010] The present invention uses a novel cost function, or optimization criterion, 
based on the distance between an estimate of the probability density function (PDF) of 
the received symbols and of the constellation. The estimate of the PDF is obtained 
directly from the received symbol data using pairwise interactions among kernels 
centered on the received data. It should be appreciated that while Gaussian kernels have 
been used herein, any of a variety of kernel types can be used. Accordingly, Gaussian 
kernels have been used for purposes of illustration only and, as such, should not be 
construed as a limitation of the present invention. 
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[0011] The present invention also utilizes a kernel size parameter to facilitate a soft 
switching from a blind equalization mode to a decision-directed equalization mode. By 
automatically switching between the two modes, the present invention can attain 
improved equalization results without adding complexity over conventional equalization 
techniques. 

[0012] One embodiment of the present invention can include an iterative method of 
equalizing an input signal received over a digital communication channel. The method 
can include (a) using a kernel density estimate where different values of the kernel size 
are indicative of either a blind or a decision-directed equalization mode, (b) processing a 
received signal using a blind equalization mode, and (c) evaluating, on a block or sample 
basis, an error measure based on the distance among a distribution of the equalizer output 
and a constellation. The method also can include (d) updating the kernel size based upon 
the error measure thereby facilitating automatic switching between the blind and 
decision-directed equalization modes, where the kernel size is initially set to a value 
indicative of the blind equalization mode. The method additionally can include (e) 
selectively applying blind equalization or decision-directed equalization to the input 
signal according to the updated kernel size for subsequent iterations of steps (c)-(e). 
[0013] Other embodiments of the present invention can include a system having 
means for performing the various steps disclosed herein as well as a machine readable 
storage for causing a machine to perform the steps described herein. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0014] There are shown in the drawings, embodiments which are presently preferred, 
it being understood, however, that the invention is not limited to the precise arrangements 
and instrumentalities shown. 

[0015] FIG. 1 is a schematic diagram illustrating one embodiment of a system in 
accordance with the inventive arrangements disclosed herein. 

[0016] FIG. 2 is a graph illustrating the smoothing effect of kernels in the operation 
of a channel equalizer in accordance with the inventive arrangements disclosed herein. 
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[0017] FIG. 3 is a graph illustrating the form of the correction factor function F(a) 
for three different criteria in accordance with the inventive arrangements disclosed 
herein. 

[0018] FIG. 4 is a flow chart illustrating a method of selectively applying blind 
equalization or decision-directed equalization to an input signal in accordance with one 
embodiment of the present invention. 

[0019] FIG. 5 is a flow chart illustrating a method of selectively applying blind 
equalization or decision-directed equalization to an input signal in accordance with 
another embodiment of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 
[0020] FIG. 1 is a schematic diagram illustrating one embodiment of a system 100 in 
accordance with the inventive arrangements disclosed herein. As shown, the system 100 
can include a communication channel 105 and a channel equalizer 110. 
[0021] The communication channel 105 can be any network communication channel 
for digital communications including, but not limited to, wired Local Area Networks 
(LAN's), Digital Subscriber Line (DSL), cable networks, telephone lines for conventional 
modems, and the like. It should be appreciated that the inventive arrangements disclosed 
herein can be applied to both linear and non-linear communication channels. For 
purposes of illustration only, however, a linear channel has been assumed. Using a 
complex baseband representation, the communication channel 105 can be represented as: 

(1) x k =^h n s k _ n +e k 

where the transmitted symbol sequence sn, or input signal 115, is assumed to take iid 
complex values from a discrete finite alphabet of symbols referred to as the constellation 
of the digital modulation. In equation (1), the sequence h n represents the complex-valued 
impulse response of the communication channel 105 and represents the additive 
channel noise for the input signal 115 that is sent through the communication channel 
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105, resulting in signal x k , or signal 118. The sequence h n can be represented as infinite 
impulse response (IIR) as well. 

[0022] The channel equalizer 1 10 can be disposed within a receiving device such as a 
modem, whether a cable modem, a DSL modem, or a conventional telephony modem, or 
other communication device. As such, it should be appreciated that the channel equalizer 
1 10 can be disposed within a larger information processing system. In any case, an input 
signal 1 15 can be transmitted via the communication channel 105. The channel equalizer 
110 operates on the received signal 118 to produce an output signal 120 which can be 
denoted as j/*. 

[0023] The output signal yt can be represented as follows: 

l w -\ 

(2) y k = Y J w nXic-n=™ T x 

where w represents a vector of equalizer coefficients to be adapted by the blind 
equalization technique to minimize inter-symbol interference (ISI). As a blind 
equalization technique assumes that no training sequence will be available for adaptation, 
the blind equalization technique exploits other properties of a received symbol sequence 
or signal 118. 

[0024] The proposed approach for blind equalization exploits the fact that the 
constellation of the transmitted symbol sequence is known. Specifically, the constellation 
can be modeled as a probability distribution of a number of Dirac-delta functions located 
at possible symbol values in the constellation. To reduce computational complexity, the 
symmetry properties of the constellation can be exploited. 

[0025] In one embodiment, for phase-shift-keying (PSK) and quadrature amplitude 
modulation (QAM) constellations, the constellation distribution can be defined over the 
possible complex-magnitude or complex values that the symbols can take rather than the 
actual complex values. One example is the probability distribution function of the 
random variable $F whose samples are given by \sjf where |.| denotes the complex- 
magnitude of the symbol from the constellation and the superscript p denotes the p ih - 
power. Another example is the probability distribution function of the random variable 
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5^ whose samples are give by the original complex values s, of a constellation. For 
purposes of illustration, the probability distribution function of the random variable S* 7 
whose samples are given by \s,f has been used. Still, it should be appreciated that the 
present invention is not so limited. 

[0026] The channel equalizer 110 attempts to match the probability distribution of the 
channel equalizer's 1 10 output signal 120 to that of the constellation symbols S. This can 
be achieved by optimizing the divergence between the distributions of these variables. 
For instance, the quadratic (Euclidean) distance between f yP (.) and f SP (.) , which is 

given by 

(3) J(Y,)=£jf YP (z)-f SP (z)) 2 dz 

can be minimized. While the quadratic distance is illustrated herein, it should be 
appreciated that any of a variety of different algebraic and/or information theoretic 
distance measures can be used. As such, the use of the quadratic distance should not be 
construed as a limitation of the present invention. 

[0027] The probability distributions required for evaluating equation (3), which can 
be referred to as the Quadratic Distance, can be estimated using Parzen windowing on 
samples of the output, for example the most recent samples, using equation (4). 

(4) h^ z )=jT!::l G ^-\yk- i \ p ) 

In equation (4), GJ^) represents the kernel function, which typically is a zero-mean 
probability distribution, with a denoting the size of the kernel function. Notably, if the 
kernel function is a Gaussian distribution, the kernel size can be controlled by its standard 
deviation. Similarly, the distribution of S* 7 can be estimated using the symbols 

,... 9 s N } in the constellation with equation (5). 




[0028] Substituting equations (4) and (5) in equation (3), equation (6) below can be 
obtained. 
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Aw) =^I,toI>>»(i y t -, i" -\y*-i i') 
(«) +^2£2S,G»<i'/i'-i*/i'> 

[0029] The weight vector of the equalizer can be optimized using the following 
stochastic gradient expression, denoted as equation (7), for minimizing the criterion in 
equation (6), where p=2 and L=2 for illustration purposes. 

V w J(w) = ^G;(|^ I 2 -\y k _i \ 2 )(y k * k -j/jfc_i x *-i) 

--^SS^flj'* I 2 -I*/ l 2 )^ i 2 i 2 )^_,x;_,] 

In equation (7), * denotes the element- wise complex-conjugate operation for the input 
vector x and G' a (.) denotes the derivative of the kernel function with respect to the 
argument of the kernel function. 

[0030] In another embodiment, an alternative algorithm can be obtained by 
approximating the divergence measure definition such as the one specified in equation (3) 
with a summation of selected critical points. For a digital communication symbol 
constellation, the critical points can be selected as the symbol values or the possible 
symbol-magnitude values. Denoting these critical points with r, and the desired density 
value at these points with T h equation (3) can be simplified to 

(8) jw-L^(f Ypiri) - Ti Y 

iy p 

1 \-^N -1 

where 7} = 2j = 0 ^ ^ ~ ' S J ' ' anc * ^ ^ ^ e num b er °f tar g et points considered, 

N s J 

which can be referred to as the Sampled probability density function (PDF) criterion. If 
r, = \sif, then F, = G^O). In this latter special case, with p = 2, a suitable stochastic 
gradient can be expressed as equation (9). 
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(9) v w y(w) = -^^[G <7 (|^ | 2 -r^-G^owAlyk I 2 -n)y k 4 
m P 

[0031] In yet another embodiment, another simplification to equation (3) can be to 
drop the integral-squared-distribution terms from the expansion of the parenthesis-square 
in the integral, which are essentially normalization terms. In such an embodiment, a 
similarity criterion that is the correlation of the probability distributions of the output 
signal 120 and the original constellation is provided, rather than divergence. 
Accordingly, the correlation would require maximizing. Such a criterion, which can be 
referred to as the Matched PDF, can be represented by equation (10). 

m ~Z£l^i<».aj'*-yi'-i"i') 

[0032] A stochastic gradient for this criterion where p = 2 and L = 1 can be expressed 
as equation (11). 

(11) V w 7(w) = -L£^g;(| y k | 2 -\ Si \ 2 )y k x k 

™ s 

[0033] The updates for the weight vector can be in the form of steepest 
descent/ascent depending on the specific criterion assumed. For example, the weight 
vector can be represented with equation (12). 

(12) w M =w k ±/^V w y(w£). 

In the case of a Gaussian kernel, for example, the kernel size dependent step size, or 
learning rate, can be selected to be /u a = //<j 3 , where 0 < pi « 1 is a small fixed constant. 

[0034] The kernel size influences the operation of the channel equalizer. In 
particular, large kernel values result in a blind-equalization type operation mode, whereas 
small values yield a decision-directed operation mode. Although the decision-directed 
mode is capable of achieving a better ISI at its global optimum, it is more susceptible to 
suboptimal solutions and can get trapped at one such weight vector. Larger kernel sizes 
have a smoothing effect on the performance surface making the search for the global 
optimum much easier. 
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[0035] FIG. 2 is a graph illustrating the smoothing effect of the kernels in the 
operation of the channel equalizer in accordance with the inventive arrangements 
disclosed herein. As shown in FIG. 2, the criterion described in equation (6) has been 
evaluated for equalizer weight values in the interval of [0,2] in a single-tap equalization 
scheme for a 16-QAM constellation. The kernel function is a Gaussian distribution with 

standard deviation o4l . In the example shown in FIG. 2, the optimal solution is when 
the coefficient w =1 . 

[0036] FIG. 2 also demonstrates that the optimal solution obtained when operating in 
blind mode can be biased from the actual solution that is desired. In order to compensate 
for this, the constellation point locations can be redefined as scaled versions of the 
original values of the constellation point locations based on a scaling factor that is a 
function of the kernel size that is assumed. For example, if p = 2 and the complex- 
magnitudes of the constellation points are used as descriptors of the target probability 

distribution, a correction factor F(cf) must be applied such that | s? | 2 = F(a) \ s i | 2 . 
[0037] The modified constellation points then can be utilized in updating the channel 
equalizer instead of the original values. The function F(cr) depends on the criterion and 
the modulation scheme that is used and must be determined a priori through 
experimentation. Thus, FIG. 2 depicts the form of this function for the three criteria 
discussed above in the 16-QAM constellation case with Gaussian kernels. 
[0038] FIG. 3 is a graph illustrating the form of the correction factor function F(cr) 
for three different criteria in accordance with the inventive arrangements disclosed 
herein. More particularly, FIG. 3 illustrates F(a) for sampled PDF, matched PDF, and 
Quadratic distance. As noted, if the kernel size is kept constant at a large value, the 
channel equalizer will operate in blind mode. Decreasing the kernel size slowly and 
smoothly, on the other hand, facilitates a smooth transition from blind mode to a 
decision-directed mode in which much more accurate solutions are attainable. 
[0039] In order to facilitate automatic switching between the blind mode and the 
decision-directed mode, the kernel size can be coupled to decision error statistics. For 
example, in one embodiment, a recursive forgetting estimate of the decision-directed 
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mean-square-error as set forth in equation (13) can be utilized to guide the manipulation 
of kernel size through iterations. 



In one embodiment, the kernel size can be set to o k = aE k +b where a and b are also 

predetermined constants specific to the constellation. For example, in the 16-QAM case, 
suitable values for a and b can be 3.5 and -9.5 respectively. In another embodiment, a 
more general function o k = f (E k , 9) with parameter 0. Notably, in one embodiment, 0 

can be replace with parameters a and b. Though the linear relationship cr k = aE k + b is 

used in the various examples disclosed herein, it should be appreciated that the linear 
relationship has been provided as an illustration only. As such, it should not be construed 
as a limitation of the present invention. 

[0040] FIG. 4 is a flow chart illustrating a method 400 of applying blind equalization 
or decision-directed equalization to an input signal in accordance with one embodiment 
of the present invention. The method 400 can begin in a state where an input signal is 
received through a communication channel and equalization is to be applied to the 
received signal. The method 400 can begin in step 405 where an initial kernel size a k 

can be set that is indicative of blind equalization. As noted, larger kernel sizes indicate 
blind equalization while smaller kernel sizes indicate decision-directed equalization. 
Notably, step 405 also can include initializing a learning rate ju^, the error statistic, a 

forgetting factor a , as well as parameters a and b. As noted, parameters a and b are 
constants that vary according to the type of communication scheme in which the present 
invention is to be used. 

[0041] In step 410, blind equalization can be applied to the input signal. Blind or 
decision-directed equalization can be performed by multiplying the input signal with a 
vector w k of equalization coefficients. 

[0042] In step 415, an error statistic can be evaluated. The error statistic can be a 
recursive forgetting estimate of the mean-square error. For example, the recursive 
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forgetting estimate of the mean-square error can be denoted as E k and evaluated 



factor, y k is the equalized signal, and s t is derived from a constellation. 

[0043] In step 420, an updated kernel size can be calculated. The updated kernel size 

can be a function of the error statistic. The kernel size, denoted as a k , can be calculated 

according to a k = aE k +6 or other function as disclosed herein. Notably, the learning 
rate also can be adjusted. 

[0044] In step 425, blind equalization or decision-directed equalization can be 
selectively applied to the input signal according to the updated kernel size. For example, 
the vector of equalization coefficients w k can be updated and applied to the input signal. 

More particularly, w k can be updated according to w k+l =w A ±/i a V Vf J(w k ), where 
J(w k ) is the matched probability density function or the sampled probability density 
function criterion, V w is the stochastic gradient, and is the learning rate modified 
according to the current value of the kernel size. 

[0045] The method 400 can repeat as necessary. As the method continues to repeat, 
the method converges on decision-directed equalization as the error statistic and kernel 
size continue to decrease. That is, the method 400 begins to automatically apply 
decision-directed equalization as the error statistic, and also the kernel size, begins to 
decrease in size. 

[0046] FIG. 5 is a flow chart illustrating method 500 of selectively applying blind 
equalization or decision-directed equalization to an input signal in accordance with 
another embodiment of the present invention. The method 500 can begin in a state where 
an input signal is received through a communication channel and equalization is to be 
applied to the received signal. In step 505, the channel equalizer can be initialized. More 
particularly, the channel equalizer coefficients w^, the learning rate ju a , the error 

estimation rate E k and the forgetting factor a can be set to initial values. In step 510, the 

parameter 6, or in another embodiment parameters a and b, can be initialized. 
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[0047] In step 515, an input signal x k can be received. In step 520, channel 
equalization can be applied to the received input signal. That is, an output signal y k can 
be calculated as y k = yv k x k . 

[0048] In step 525, the kernel size and the learning rate can be determined. The 
kernel size can be set or updated using the formula a k = f(E k ,&) or o k =aE k + 6, 
where 6, a, and b are constants as noted and E k is the error estimation from the previous 

(or initial) iteration. The error learning rate can be set to ju a = //cr , where 0 < ju « 1 is 

a small fixed constant. In another embodiment, however, ju can be allowed to vary in 
time. 

[0049] In step 530, the correction factor F(or) can be determined. As the function 
F{(7) is calculated a priori, the correction factor can be determined from a lookup table or 
other data structure storing such values in memory. In step 535, the equalizer weight can 
be updated as follows: w k+] =w* ±ju a V w J(w k ) . The method 500 can repeat as 

necessary. As the method 500 continues to iterate, the equalization applied to the input 
signal moves from blind to decision-directed. As noted, method 500 begins to 
automatically apply decision-directed equalization as the error statistic, and also the 
kernel size, begins to decrease in size. 

[0050] The present invention provides an equalization solution that does not require 
training data and, in consequence, utilizes less bandwidth than supervised equalizers. 
The inventive arrangements disclosed herein converge upon a solution using fewer 
samples, i.e. approximately 2,000-3,000 samples, thereby attaining increased data- 
efficiency. Further, the present invention provides automatic switching to decision- 
directed equalization. The aforementioned attributes of the inventive arrangements 
disclosed herein are provided without adding computational cost. 

[0051] The present invention can be realized in hardware, software, or a combination 
of hardware and software. The present invention can be realized in a centralized fashion 
in one computer system or in a distributed fashion where different elements are spread 
across several interconnected computer systems. Any kind of computer system or other 
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apparatus adapted for carrying out the methods described herein is suited. A typical 
combination of hardware and software can be a general-purpose computer system with a 
computer program that, when being loaded and executed, controls the computer system 
such that it carries out the methods described herein. 

[0052] The present invention also can be embedded in a computer program product, 
which comprises all the features enabling the implementation of the methods described 
herein, and which when loaded in a computer system is able to carry out these methods. 
Computer program in the present context means any expression, in any language, code or 
notation, of a set of instructions intended to cause a system having an information 
processing capability to perform a particular function either directly or after either or both 
of the following: a) conversion to another language, code or notation; b) reproduction in 
a different material form. 

[0053] This invention can be embodied in other forms without departing from the 
spirit or essential attributes thereof. Accordingly, reference should be made to the 
following claims, rather than to the foregoing specification, as indicating the scope of the 
invention. 
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